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We present a search for the standard model Higgs boson (H) in pp collisions at ^fs = 1.96 TeV 
in events containing a charged lepton (I), missing transverse energy, and at least two jets, using 
5.4 fb _1 of integrated luminosity recorded with the DO detector at the Fermilab Tevatron Collider. 
This analysis is sensitive primarily to Higgs bosons produced through the fusion of two gluons or two 
electroweak bosons, with subsequent decay H — > WW — > ivq'q, where I is an electron or muon. The 
search is also sensitive to contributions from other production channels, such as W H — > lubb. In the 
absence of signal, we set limits at the 95% C.L. on the cross section for H production cr(pp — > H + X) 
in these final states. For a mass of Mh = 160 GeV, the limit is a factor of 3.9 larger than the cross 
section in the standard model, and consistent with expectation. 

PACS numbers: 13.85.Rm, 14.80.Bn 



The Higgs mechanism [l|-|4| accommodates the ob- 
served breaking of electroweak (EW) symmetry in the 
standard model (SM). In addition to generating masses 
for the EW W and Z bosons, as well as for fermions, 
the theory predicts a new scalar Higgs boson (H) with 
well-determined couplings, but unknown mass (Mh)- 
Confirmation of the existence and properties of the H 
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boson would be a key step in elucidating the origins 
of EW symmetry breaking. For a Higgs boson with 
mass Mh ^ 135 GeV, the dominant decay mode is 
H — > W + W~ , where at least one W boson must be vir- 
tual when Mh < 2Mw- Previous searches jH-0] for this 
process were based on events with two charged leptons (£) 
and large missing transverse energy ($t) from the decay 
H -> W+W~ -> ivl'v 1 (£ = e,/x). This Letter presents 
the first search for production of Higgs bosons with sub- 
sequent decay to WW having only one charged lepton 
in the final state. The data sample is 5.4 fb _1 of in- 
tegrated luminosity in pp collisions at -y/s = 1.96 TeV 
recorded with the DO detector at the Fermilab Tevatron 
Collider. The largest SM contributions to the inclusive 
cross section for producing H bosons in pp collisions are 
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from mechanisms involving the fusion of two gluons or 
two weak vector bosons into an H boson, and associated 
production of H and a weak vector boson (V = W or Z). 
In the following we will not distinguish between par- 
ticles and antiparticles. The most striking signatures 
from H — > VV decays are in the purely leptonic final 
states, but these account for only 5% of such decays. Fi- 
nal states containing a single charged lepton have larger 
backgrounds, but their branching fractions are a factor 
of s» 6 larger than for the all-leptonic modes. 

A recent calculation of the differential width for 
H — » WW — > ivq'q decays Q supports the important 
role of these mixed modes for characterizing a potential 
SM Higgs-boson signal. Our analysis is most sensitive to 
final-state topologies with a single lepton (e or /i), two 
or more jets, and $t, arising from H — > WW — > ivq'q 
decays. For Mjj ^ 140 GeV, significant sensitivity is 
gained from WH — > ivbb, where we do not attempt to 
identify the b quark flavor. Smaller signal contributions 
from H — > ZZ — > il^qq, where / represents an unidenti- 
fied lepton, and H — > WW — > rvq' q with r — > Ivv are 
also included. For Mh > 160 GeV, assuming that the 
observed Ifir is due to the neutrino from the decay of a W 
boson, it is possible to reconstruct the longitudinal mo- 
mentum (p v z ) up to a twofold ambiguity, and thereby ex- 
tract the mass of the H decaying to WW [§] . We choose 
the solution with smallest |Re(p^)| to calculate Mh- The 
primary backgrounds are from V+jets, top quark, dibo- 
son production, and multijet (MJ) events containing a 
lepton or lepton-like signature with J£t generally arising 
from mismeasurement of jet energies. 

The DO detector [l(| consists of tracking, calorimetric 
and muon detectors. Charged particle tracks are recon- 
structed using silicon microstrip (SMT) detectors and a 
scintillating fiber tracker, within a 2 T solenoid. Three 
uranium/liquid- argon calorimeters measure particle en- 
ergies that are reconstructed into hadronic jets using an 
iterative midpoint cone algorithm with a cone radius of 
0.5 |ll|]. Electrons and muons are identified through as- 
sociation of charged particle tracks with clusters in the 
electromagnetic sections of the calorimeters or with hits 
in the muon detector, respectively. We obtain the $t 
from a vector sum of transverse components of calorime- 
ter energy depositions and correct it for identified muons. 
Jet energies are calibrated using transverse momentum 



balance in photon+jet events 12j, and the correction 
is propagated to the $t- The data are recorded using 
triggers designed to select single electrons or muons and 
combination of electron and jets. After imposing data 
quality requirements, the total integrated luminosity is 
5.4 fb _1 [13J, where the first 1.1 fb _1 , Run Ha, precedes 
an upgrade to the SMT and trigger systems. The re- 
maining 4.3 fb _1 is denoted as Run lib. The four data 
sets using e or /i for the two run epochs are analyzed 
separately and combined in the final result. 

Background contributions from most SM processes are 



determined through Monte Carlo (MC) simulation, while 
multijet background is estimated from data. The dom- 
inant background is from V+jets processes, which are 
generated with ALPGEN [14]. The transverse momentum 
(pr ) spectrum of the Z boson in the MC is reweighted 
to match that observed in data . The pr spectrum of 
the W boson is reweighted using the same dependence, 
but corrected for differences between the pr spectra of 
Z and W bosons predicted in next-to-next-to-leading or- 
der (NNLO) QCD [lf|. Backgrounds from tt and elec- 
troweak single top quark production are simulated us- 
ing the ALPGEN and COMPHEP 17| generators, respec- 
tively. Vector boson pair production and H boson signals 
are generated with pythia 18]. All these simulations 
use CTEQ6L1 parton distribution functions (PDF) [lj| . 
Both ALPGEN and COMPHEP samples are interfaced with 
pythia for modeling of parton evolution and hadroniza- 
tion. 

Relative normalizations for the various V+jets pro- 
cesses are obtained from calculations of cross sections at 
ncxt-to- leading order using MCFM [20] , while the absolute 
normalization for the total V+jets background is con- 
strained through a comparison to data, following the sub- 
traction of other background sources. This increases the 
normalization for V+jets background by about 2%, com- 
pared with the expectation from ALPGEN normalized us- 
ing total cross sections calculated at NNLO [2l[ with the 



MRST2004 NNLO PDFs [22J. Cross sections for other 
SM backgrounds are taken from Ref. [23j, or calculated 
with MCFM, and those for signal are taken from Ref. fiij ]. 
The pt spectra for diboson events in background are cor- 
rected to match those of the MC@NLO generator [25j] . The 
Pt spectra from the contribution of gluon fusion to the 
H boson signal, generated in pythia, are modified to 
match those obtained from SHERPA [261 ] - 

Signal and background events from MC are passed 
through a full GEANT3-based simulation [27} of detector 
response, then processed with the same reconstruction 
program as used for data. Events from randomly selected 
beam crossings with the same instantaneous luminosity 
profile as data are overlaid on the simulated events to 
model detector noise and contributions from the pres- 
ence of additional pp interactions. Paramctcrizations of 
trigger efficiency for leptons are determined using Z — > ££ 
decays [28| . Any remaining differences between data and 
simulation in the reconstruction of electrons, muons, and 
jets are adjusted in simulated events to match those ob- 
served in data and these corrections are propagated to 
the MET. 

Events are selected to contain candidates for W — > Iv 
decay by requiring $t > 15 GeV and the presence of 
a lepton with pt > 15 GeV that is isolated relative to 
jets, namely located outside jet cones AR(£,j) > 0.5, 
with (AR) 2 = {<j) e - cj>>) 2 + {jf - Ty^^where <j> x and 
rf are the azimuth and pseudorapidity [29( of object x. 
The pp interaction vertex (PV) position along the beam 
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TABLE I: Number of signal and background events expected after selection requirements. The signal sources include gluon- 
gluon and vector-boson fusion, and associated production WH. The three numbers quoted for the signals correspond to 
Mh = 130, 160, and 190 GeV. For backgrounds, "top" includes pair and single top quark production and "VV" includes all 
non-signal diboson processes. The overall background normalization is fixed to the data by adjusting the V+jets cross sections. 



Channel 


99 -> H 


qq -> qqH 


WH 


V+jets 


Multijet 


Top 


VV 


Total Background 


Data 


Electron 


11.2/46.3/27.8 


2.1/6.4/4.2 


7.2/0/0 


52158 


11453 


2433 


1584 


67627 


67627 


Muon 


9.5/34.7/20.4 


1.5/4.4/2.9 


5.7/0/0 


47970 


2720 


1598 


1273 


53562 


53562 



direction (zpv) is required to be reconstructed within 
the longitudinal acceptance of the SMT, |zpy| < 60 cm. 
The lepton is required to originate from the PV and to 
pass more restrictive isolation criteria based on tracking 
information and energy deposited near the lepton in the 
calorimeter. Electrons must also satisfy criteria for the 
spatial distribution of the shower, and timing information 
is used to reject cosmic ray background in events with 
muons. All lepton selections are described in Ref. [3fjj ]. 
except that this analysis requires both the scalar sum of 
track px and calorimeter energy in the vicinity of the 
muon be less than 2.5 GeV. Electrons and muons are 
required to be located within |?7det| < 1-1 and < 1.6, 
respectively, where rjdet is the pseudorapidity assuming 
the object originates from the center of the detector. To 
reduce background from Z — > ££, top quark, and diboson 
events, and to assure selected events do not overlap with 
those used in the WW —> tvl'v 1 analysis channels, we 
veto any event containing an additional lepton satisfying 
less stringent identification criteria. We also require at 
least two jets with \rf\ < 2.5 and pr > 20 GeV that 
contain associated tracks originating from the PV. The 
jet pr requirement is 23 GeV when the second-leading jet 
(ordered in pr) has 0.8 < \i]d e t\ < 1.5 [loj| . The leading 
two jets are used to reconstruct the W boson deca yin g 



to q'q. To suppress background from MJ events [31|, 
we require events to have Mjf (GeV 
where M^f is the transverse mass [J 
candidate. 



> 40 - 0.5 x $ T , 
| of the W boson 



To estimate the MJ background, we use data sam- 
ples orthogonal to our signal sample. For the electron 
channel, we form a "loose" category for which the selec- 
tion on a likelihood discriminant used to select a "ti ght " 
electron, based on calorimeter and track variables }31j . 
is reversed. Following the method of Ref. (33[, the MJ 
background is evaluated from independently-determined 
probabilities for loose electrons or jets to pass the tight 
signal selections. For the muon channel, we reverse re- 
quirements on muon isolation in both the tracking de- 
tectors and the calorimeters, and subtract contributions 
arising from SM processes containing a true muon from 
W or Z decay. The normalization is obtained from fits 
to both the y+jets and MJ contributions using observed 
distributions of p 1 ^ and $t- Event yields in data and 
those expected for signal and background are shown in 
Table H 



We use a random forest (RF) of 50 decision trees (DT) 
to separate signal from background 34, 35|. Each DT is 
trained on a randomly selected collection of signal and 
background MC events as well as MJ events from data. 
The DTs examine a random set of about 30 discriminat- 
ing variables formed from particle 4- vectors, angles be- 
tween objects, and combinations of kinematic variables 
such as reconstructed masses and event shapes. An RF is 
trained separately for each data set, using signal hypothe- 
ses 115 < M H < 200 GeV in steps of 5 GeV. The outputs 
of the final RF discriminants for the four data sets com- 
bined, background, and signal for Mji = 160 GeV are 
shown in Fig. Q] . Agreement is observed with expecta- 
tions from SM background, and the RF-output distribu- 
tions are therefore used to set upper limits on the cross 
section for SM Higgs production. 
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FIG. 1: The output of RF discriminants for data, different 
backgrounds, and signal for Mh = 160 GeV for the combined 
data sets. 

Systematic uncertainties affect the normalizations and 
distributions of the final discriminants and are included 
in the determination of limits. These arise from a vari- 
ety of sources, and their impact is assessed by changing 
each input discriminant to the RF by ±1 standard devi- 
ation. The most significant uncertainties are from cali- 
bration of jet energies (0.7-6)%, jet resolution (0.5-3)%, 
jet reconstruction efficiency (0.5-4)%, lepton identifica- 
tion and modeling of the trigger (4%), estimation of the 
multijet background (6.5-26)%, and integrated luminos- 
ity (6.1%). Theoretical uncertainties on cross sections for 
backgrounds are taken from Ref. 0, [Hf . The uncertain- 
ties on cross sections for signal are taken from Ref. [Hj]. 
Because the overall cross section for F+jets production is 
constrained by data, the uncertainty on its normalization 



G 



is anti-correlated with the MJ background. The impacts 
of theoretical uncertainties on distributions of the final 
discriminants are assessed by varying a common renor- 
malization and factorization scale, by comparing ALPGEN 
interfaced with herwig 36] to ALPGEN interfaced with 
pythia for V+jets samples, and by varying PDF pa- 
rameters using the prescription of Ref. [19( for all MC 
samples. 

Upper limits on the production cross section multiplied 
by branching fractions are determined using the modified 
frequentist CL$ approach [37|. A test statistic based on 
the logarithm of the ratio of likelihoods (LLR) (3?} for the 
data to represent signal+background and background- 
only hypotheses is summed over all bins of the final dis- 
criminant in each set. To minimize degradation in sensi- 
tivity, scaling factors for the systematic uncertainties are 
fitted to the data by maximizing a likelihood function 
for both the signal+background and background-only hy- 
potheses, with the systematic uncertainties constrained 
through Gaussian priors on their probabilities 38] . Cor- 
relations among systematic uncertainties in signal and 
background are taken into account in extracting the fi- 
nal results. Figure [2] shows the combined background- 
subtracted data and the uncertainties on the RF discrim- 
inant after they are fitted to the data. 
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FIG. 2: The combined background subtracted data and one 
standard deviation (s.d.) uncertainty on total background af- 
ter applying constraints on systematic uncertainties by fitting 
to data. The expected SM Higgs signal for M H = 160 GeV, 
shown by the line, is scaled up by a factor of 5. 

The resulting limits on standard model Higgs boson 
production are given in Table [II] The LLRqbs values 
shown in Fig. [3] as functions of Mh are within ~ 1.5 
standard deviations of the expected median for LLRb, 
the background-only hypothesis, as calculated from sta- 
tistical fluctuations and systematic uncertainties. 

In conclusion, we have determined the first limits on 
standard model Higgs boson production examining de- 
cays of the Higgs boson to two vector bosons, one of 
which decays leptonically and the other into a pair of 
quarks. For Mh = 160 GeV, the observed and expected 
95% C.L. upper limits on the combined cross section 
for Higgs production, multiplied by the branching frac- 
tion for H + X —> £ + / / v + qq, are factors of 3.9 and 5.0 
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FIG. 3: The observed LLRs for the combined data sets are 
given by the solid line, the expected LLRs for the background- 
only and signal+background hypotheses are shown as dots 
and dashes, respectively, and the dark and light-shaded areas 
correspond to one and two standard deviations (s.d.) around 
the expected LLRs for the background-only hypothesis. Neg- 
ative values of LLRobs represent signal-like fluctuations in 
the data. 



larger than the SM cross section, respectively. 

Supplementary material, including a list of variables 
used in the RF, samples of input distributions, and a 
table of systematic uncertainties, is available from [39j . 
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TABLE III: Variables used in the random forest classifiers, given in categories of object kinematics, angular variables, and 
event kinematics. The symbol (*) denotes a measurement in the rest frame of the reconstructed H boson. A subset of these is 
used for each channel and data-taking period. 



Object Kinematics 

Lepton energy E l 



Lepton/jet transverse momentum 


Pt,Pt,Pt 






Azimuthal angles 


Act>(ji,j 2 ),Act>(£,E T ), 




min [ A<t>(p T , ji ) , A<j,(E T , h )] , min [ A<t>(£, 3 \ ) , A<f>(£, j 2 )] 




A0(bisector of dijet pair, $t) 


Polar angles 


Ar/(j'i, fc) 


rj — <f> combinations 


AR(j 1 J 2 ),max[AR(£,j 1 ),AR(£,j 2 )],mm[AR(E T ,j 1 ),AR(E T ,j 2 )] 


3D angles 


angle (ji,j 2 )\ angle (Ji, reconstructed W — > iv)^ , angle(^, dijet system) 


Event Kinematics 




Velocity 


/3(dijet system) 


Transverse momentum sums 


^T,PT(dijet system) , pt (leptonic W) 




H T (ji,h), H T (all jets), H T (ji,j 2 ,£), H T {ji,j 2 ,£,E T ) 


Topological variables" 


aplanarity(ji,j' 2 ,4 v) 




centrality(£, ji , j 2 ) , centrality(£, all jets) 




sphericity(£, j\ ,j 2 ), sphericity (£, j\ , j 2 , E T ) 


"N"-body masses 


dijet mass : M(j 1 ,j 2 ), M(j 1 ,j 2 ,£), M(WW), WW "visible mass": M(ji,j 2 ,£,E T ) 


Transverse masses 


M T (Ji,h), M T (W -> £u), M T {WW) 


Ratio of jet energies 


E 32 / E 31 t 


^■^imax/min 


Afl(ji,ja) • E T (j 1/2 )/(E e T +E T ) 


Other combinations 


scalar product of flr and angular bisector of dijet pair 




A<j)(W e , bisector of dijet system) 




Magnitude of pt J1 perpendicular to dijet system 


Scaled E T 


Y, [v / ^rsin(^)cos(A^.(i„^ T ))] 2 




all jets 



"For descriptions of the topological variables see: V.D. Barger and R.J.N. Phillips, Collider Physics, Addison- Wesley, Reading, MA, 1987. 
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FIG. 4: Sample of DT inputs for (a,c,e) electron and (b,d,f) muon channels. The data are shown as points with uncer- 
tainties. The background contributions are shown as histograms, with sources indicated in the legend. "V+jets" includes 
(W/Z) + (u, d, s, c,b, g) jets, MJ is the multijet background, "top" includes pair and single top quark production, diboson 
includes WW, WZ, and ZZ processes. The distributions for signal are multiplied by a factor of 200. 
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TABLE IV: Systematic uncertainties for the electron and muon channels. Signal uncertainties are shown for Mh = 160 GeV 
for all channels except for WH, shown for Mh = 115 GeV. Those affecting the shape of the RF discriminant are indicated 
with "Y." Uncertainties are listed as relative changes in normalization, in percent, except for those also marked by "S," where 
the the overall normalization is constant, and the value given denotes the maximum percentage change from nominal in any 
region of the distribution. 



Contribution 


Shape 


W+jets 


Z±jets 


top 


diboson 


gg^H 


qq -> qqH 


WH 


Jet energy scale 


Y 


(+6.7\ a 

1-5.4/ 


< 0.1 


±0.7 


±3.3 


(-+5.7N 


±1.5 


/+2.7N 
\-2.Z) 


Jcli lU-cllLlllCetLlUll 


Y 


+fi fS s 
xu.u 


<r fl 1 

\ U.J- 






+1 n 


±1.1 


+1 n 


Jet resolution 


Y 




< 0.1 


±0.5 


CSS) 


1-0.5/ 


±0.8 


±1.0 


Association of jets with PV 


Y 


±3. 2 s 


±1.3 S 


±1.2 


±3.2 


±2.9 


±2.4 


(+0.9N 
1-0.2/ 


Luminosity 


N 


n/a 


n/a 


±6.1 


±6.1 


±6.1 


±6.1 


±6.1 


Muon trigger 


Y 


±0.4 S 


< 0.1 


< 0.1 


< 0.1 


< 0.1 


< 0.1 


< 0.1 


Electron identification 


N 


±4.0 


±4.0 


±4.0 


±4.0 


±4.0 


±4.0 


±4.0 


Muon identification 


N 


±4.0 


±4.0 


±4.0 


±4.0 


±4.0 


±4.0 


±4.0 


ALPGEN tuning 


Y 


±l.l s 


±0.3 S 


n/a 


n/a 


n/a 


n/a 


n/a 


Cross Section 


N 


±6 


±6 


±10 


±7 


±10 


±10 


±6 


Heavy-flavor fraction 


Y 


±20 


±20 


n/a 


n/a 


n/a 


n/a 


n/a 


PDF 


Y 


±2.0 S 


±0.7 S 


< 0.1 s 


< 0.1 s 


< 0.1 s 


< 0.1 s 


< 0.1 s 



Electron channel Muon channel 

Multijet Background Y ±6.5 ±26 
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FIG. 5: Ratio of the 95% CL limit to the SM expectation as a function of Mh for the electron and muon channels combined. 
Expected limits are shown by the dashed line, observed limits are shown by the solid line. The heavy and light shaded areas 
correspond to one and two standard deviations (s.d.) around the expected result for the background-only hypothesis. 



